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SOME  BADLY  BEHAVED  CLOSED  QUEUEING 
NETWORKS 

J.   MICHAEL  HARRISON   AND  VIEN   NGUYEN* 

Abstract.  We  study  several  variations  of  a  multiclass  closed  queueinp  network 
model,  all  closely  related  to  the  open  priority  network  of  Lu  and  Kumar.  Foi  each  of  our 
examples  it  is  shown  that,  as  the  population  size  n  — •  oo,  the  long-run  utilization  rate 
of  each  server  approaches  a  limit  strictly  less  than  one.  Thus  no  "bottleneck  station" 
emerges  in  the  heavy  traffic  limit.  This  is  analogous  to  the  "bad  behavio' '  of  open 
network  examples  developed  by  Bramson  and  by  Lu  and  Kumar,  among  others,  in 
which  queue  sizes  grow  without  bound  even  though  each  service  static^.;  has  a  traffic 
intensity  parameter  less  than  one.  Two  of  the  three  examples  considered  here  have 
preemptive-resume  priority  service  disciplines  at  each  station,  as  in  the  Lu-Kumar  open 
network  example,  and  one  has  FIFO  service  at  each  station,  as  in  Bramson's  open 
network  examples. 

Key  words.  Closed  queueing  networks,  multiclass  networks,  priority  networks, 
bottlenecks 

AMS(MOS)  subject  classifications.   9OB15,60K25,60K20,60K05 

1.  Introduction.  Over  the  last  few  years  queueing  theorists  have 
been  shocked  to  discover  that  there  exist  open  network  models  which  are 
unstable  even  though  each  service  station  has  more  than  enough  capacity 
10  handle  the  load  imposed  on  it.  Perhaps  the  simplest  example  is  the 
two-station  priority  network  pictured  in  Figure  1.1,  which  was  originally 
studied  by  Lu  and  Kumar  [3].    Here  customers  arrive  accordincr  to  a  re- 
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Classes  4  and  2  have  priority  al  stations  1  and  2,  respectively 
Fig.  1.1.   The  Lu-Kumar  priority  netwoTk 


newal  process  at  average  rate  A,  and  they  follow  a  fixed  deterministic  route 
requiring  four  service  operations.    The  four  operations  are  performed  at 
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stations  1,  2,  2  and  1  successively.  Customers  that  are  waiting  for  or  un- 
dergoing their  fc"*  operation  will  be  called  class  k  customers.  Each  class  is 
allowed  to  have  its  own  service  time  distribution  (thus  there  are  four  mu- 
tually independent  service  time  sequences,  also  independent  of  the  arrival 
process),  and  we  denote  by  rufc  the  mean  service  time  for  class  k.  Finally, 
there  is  a  preemptive- resume  priority  service  discipline  at  each  station,  the 
priorities  being  as  shown  in  Figure  1.1. 

In  their  original  paper,  Lu  and  Kumar  [3]  treated  a  version  of  this 
model  with  deterministic  interarrival  and  service  times.  Assuming  specific 
numerical  values  for  A,  mi, . . .  and  7714,  they  showed  that  queue  lengths  can 
grow  without  bound  in  the  deterministic  priority  network  even  though 

(1.1)  pi  =  \{mi  +  m^)  <  \     and     p2  = -^("^2  +  "^3)  <  1- 

Independently  of  this  work,  Rybko  and  Stolyar  [4]  analyzed  the  first  legiti- 
mately stochastic  network  model  shown  to  be  unstable  with  traffic  intensity 
parameters  less  than  one.  In  an  important  recent  paper,  Dai  and  Weiss  [2] 
studied  the  stochastic  version  of  the  Lu-Kumar  model  described  above,  and 
discovered  the  following  additional  condition: 

(1.2)  A(m2-fm4)<l. 

Dai  and  Weiss  [2]  showed  that  (1.1)  and  (1.2)  together  are  sufficient  for 
stability  ;^;hat  is,  positive  Harris  recurrence)  of  the  stochastic  Lu-Kumar 
network.  'I hey  also  showed  that  (1.2)  is  necessary  for  the  network's  de- 
terministic fluid  analog  to  be  stable,  which  strongly  suggests  that  (1.2)  is 
necessary  for  stability  of  the  queueing  network  as  well. 

It  may  be  argued  that  "legitimate"  networks  can  be  made  unstable  by 
applying  a  "bad"  priority  scheme,  such  as  those  used  in  the  Lu-Kumar 
and  Rybko-Stolyar  examples.  Bramson  [1]  recently  presented  the  surpris- 
ing result  of  a  multiclass  network  that  is  unstable  even  when  its  intensity 
parameters  are  less  than  one  and  stations  serve  customers  on  a  FIFO  ba- 
sis. Whitt  [7]  and  Seidman  [6]  also  studied  unstable  deterministic  networks 
with  FIFO  service. 

Can  a  closed  queueing  network  exhibit  "bad  behavior"  analogous  to 
the  instability  described  above  for  open  networks?  In  this  paper  we  answer 
that  question  in  the  affirmative,  as  follows.  We  consider  several  closely  re- 
lated two-station  closed  networks,  denoting  by  n  the  fixed  population  size, 
and  show  that  for  each  of  the  examples  considered,  the  long-run  utilization 
rates  of  both  servers  approach  limits  strictly  less  than  one  as  n  — >  oc  Thus 
no  "bottleneck  station"  emerges  as  the  population  grows  in  size,  although 
the  existence  of  such  a  bottleneck  has  often  been  implicitly  assumed  in 
theoretical  analyses  of  closed  networks  [5].  Readers  should  note  that  our 
"badly  behaved"  closed  network  examples  are  true  multiclass  r.etworks, 
with  different  classes  served  at  the  same  station  having  different  moan  ser- 
vice times,  whereas  most  previous  work  on  both  open  and  closed  networks 
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Fig.  1.2.   Structure  of  closed  network  examples  to  be  studied 

has  assumed  a  single  service  time  distribution  for  all  classes  served  at  a 
station. 

To  be  more  specific,  we  will  focus  on  the  closed  version  of  the  Lu-Kumar 
model  pictured  in  Figure  1.2,  with  two  single-server  stations,  four  customer 
classes,  and  the  deterministic  cyclic  routing  shown.  Two  very  simple  ex- 
amples are  treated  in  the  next  section,  each  characterized  by  extreme  dis- 
tributional assumptions  (some  service  times  are  small  and  bounded  above, 
others  are  large  and  bounded  below),  which  eliminate  the  need  for  sophis- 
ticated analysis.  The  first  of  our  elementary  examples  uses  the  Lu-Kumar 
preemptive- resume  priority  scheme  (see  Figure  1.1),  and  our  second  ex- 
ample shows  that  essentially  the  same  system  behavior  may  be  observed 
with  FIFO  scheduling,  at  least  if  one  allows  some  service  times  to  vanish 

as  71  — >  CXD. 

In  section  3  we  study  a  precise  closed  analog  of  the  Lu-Kumar  open 
network,  with  preemptive-resume  priority  scheduling  at  each  station,  with 
all  four  service  time  distributions  assumed  to  be  exponential,  and  with 
mean  service  times  assumed  to  satisfy 


(1.3) 


7712-1-7714    >  max(77Xl   -|-  1714,1712  +  rn^) . 


Thus  our  final  example,  unlike  those  treated  in  section  2,  is  not  character- 
ized by  exotic  distributional  forms  or  extreme  parameter  values,  although 
one  realizes  after  the  fact  that  the  priority  rule  assumed  is  a  rather  bad 
one.  The  limiting  long-run  average  throughput  rate  (as  7i  — *  oo)  is  ex- 
plicitly calculated,  and  this  corresponds  to  less-than-full  utilization  of  each 
server. 


2.   Two  simple  examples. 

Priority  service.  With  the  routing  picture  in  Figure  1.2,  first  assume 
that  classes  4  and  2  have  preemptive- resume  priority  at  stations  1  and  2,  re- 
spectively. Let  {vk{i),i  =  1,2, . . .},  A:  =  1,2,3,4,  be  independent  sequences 
of  i.i.d  non-negative  random  variables  representing  processing  times  of  class 
k  jt'bs.  We  suppose  that  for  some  6  >  0  the  following  inequalities  hold  al- 


4  Harrison  and  Nguyen 

most  surely: 

,  l'2(l)       >      <5  1^4(1)       >       <5 

^    ^  v^{l)     <     6  i;3(l)     <     6. 

These  bounds  imply  (recall  mt  =  E[?;fc(l)]) 

(2.2)  m£Lx(mi,m3)  <  min(m2,  7/14). 

Suppose  that  all  n  customers  are  queued  initially  at  station  i  as  class 
1  (i.e.,  Qi(0)  =  n  and  Q2(0)  =  QsW  =  QaW  =  0),  so  that  station  1 
contains  only  low  priority  work.  Set 

(2.3)  To     =     inf{«  >  0:gi(i)  =  n}  =  0     and 

(2.4)  T,+i     =     mf{t>Ti+6:Q^{l)  =  n}. 

It  follows  from  the  bounds  on  service  times  (2.1)  that  while  station  1  is 
processing  low  priority  work,  each  class  1  customer  will  depart  station  1 
and  enter  station  2  before  the  previous  customer  can  complete  service  at 
station  2  Consequently,  assuming  that  station  1  is  not  interrupted  by 
arrival  of  a  high  priority  (class  4)  customer,  station  2  will  be  continuously 
busy  with  class  2  work  after  the  initial  service  of  a  class  1  customer.  Let 
T  be  the  sum  of  the  first  class  1  service  at  station  1  and  the  first  n  class  2 
services  at  station  2.  Because  of  the  preemptive-resume  priority  policy  at 
station  2,  no  class  4  customers  are  generated  between  7o  and  Tq  +  r  and 
so  at  lirne  To  +  t  all  n  customers  are  queued  at  station  2  as  low  priority 
class  N  \/ork. 

The  network  now  evolves  in  exactly  a  symmetrical  manner  as  before. 
Services  of  high  priority  (class  4)  customers  at  station  1  begin  ;<iter  the 
intial  service  of  a  class  3  customer,  and  station  1  continuously  serves  class 
4  until  all  n  customers  have  been  served,  at  which  time  the  entire  population 
is  again  queued  at  station  1  as  class  1. 

It  should  be  clear  from  this  discussion  that 

E[Ti]  =  n(m2  +  1714)  +  {mi  +  1713)  . 

Moreover,  Ti,  T2, . . .  form  regeneration  points  for  the  queue  length  process 
QU)  '-'-  {Q\{t),Q2it),Q3{t.),Q4{t)].  Denoting  by  A  be  the  long-run  through- 
put rate  of  customers,  it  follows  from  the  theory  of  regenerative  processes 
that 

E[number  of  customers  exiting  station  1  between  Tq  and  Ti] 

(2-5)  =  -^— — " ,    ^ — -,, 

n(m2  +  m^)  +  (mi  +  m^) 
and  ve  arrive  at  the  main  result  of  this  subsection: 

(2.6)  A  —>  as  n  — »  00. 

m-z  +  m4 


«2(1) 

> 

8 

v,{\) 

> 

5 

^^l(l) 

< 

6/n 

V3{1) 

< 

S/n, 
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Equations  (2.2)  and  (2.6)  together  imply  tliat  the  long-run  utilization  rate 
for  station  1  is  given  by 

.                 .        mi  + 1714 
X(Tni  +  1714)  ~*  <  1     as  n  — »  oo, 

7712  +  1^4 

and  similarly  for  station  2,  the  long-run  utilization  rate  is 

\  /  ,  \  ^17.2  +  1713 

A[m2  +  1713)  —* <  1     as  n  — >  oo. 

Tn2  + 1714 

FIFO  service.  We  now  consider  a  FIFO  version  of  the  network  dis- 
cussed in  the  previous  subsection.  The  routing  of  customers  remains  ex- 
actly the  same  as  shown  in  Figure  1.2  and  customers  are  served  on  a  FIFO 
basis  at  each  station.  Using  the  same  notation  as  before,  let  us  now  impose 
the  following  bounds  on  service  times: 

(2.7) 

which  of  course  implies  max(Tni,m3)  <  min(m2,m4).  We  will  find  it  con- 
venient to  refer  to  classes  1  and  3  as  customers  requiring  "short"  services 
and  classes  2  and  4  as  those  requiring  "long"  services.  Let  us  again  assume 
initial  conditions  Qi(0)  =  n  and  Q2(0)  =  QslO)  =  QiiO)  =  0;  that  is, 
all  customers  are  initially  queued  at  station  1  as  class  1  awaiting  a  short 
sei  'ice.  From  (2.7),  one  can  verify  that  station  1  will  finish  s(-iving  all  n 
clas.s  1  customers  (that  is,  all  n  customers  will  arrive  to  station  2)  before 
station  2  completes  its  first  long  service.  Thus  all  n  class  2  services  must  be 
completed  at  station  2  before  the  first  class  3  service  can  begin  Defining 
T  exactly  as  above,  we  find  that  at  time  t  all  n  customers  are  queued  at 
station  2  as  class  3.  One  may  then  argue  exactly  as  above  to  conclude  that 
(2.5)  remains  valid  for  a  single  FIFO  system  satisfying  (2.7).  In  letting 
n  — t  00,  we  may  keep  the  service  time  distributions  for  classes  2  and  4 
(the  long  service  operations)  fixed,  but  (2.7)  demands  that  service  times 
for  classes  1  and  3  vanish,  implying  that  mi,  ms  — +  0.  Thus  (2.6)  remains 
valid  in  the  current  setting,  and  the  long-run  utilization  rates  at  stations  1 
and  2,  respectively,  converge  as  n  — »  00  to 

m,4  ,  ,  mo 

<  1     and <  1. 


1712  +  m.4  7712  +  'm4 

3.  A  closed  version  of  the  Lu-Kumar  priority  network.  With 
the  routing  portrayed  in  Figure  1.2,  let  us  return  to  the  assumption  of 
preemptive-resume  priority  scheduling,  with  classes  4  and  2  having  priority 
at  stations  1  and  2,  respectively,  as  in  the  Lu-Kumar  open  network  (see 
Figure  1.1).  Also,  let  all  four  service  time  distributions  be  exponential. 
With  mean  service  times  satisfying  (1.3)  we  can  (and  do)  choose  units  so 
that 

(3.1)         7712  +  'n'l-i  =  1     but     m-i  +  m.4  <  1     and     1712  +  1713  <  1. 
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With  exponentially  distributed  service  times  and  preemptive  resume  prior- 
ity scheduling  the  queue  length  process  Q{t)  =  \Qi{t),Q2{t),Q3{t),Q4(t)] 
is  a  Markov  process  with  finite  state  space,  but  some  of  its  states  are  tran- 
sient, as  explained  below.  Suppose  that  both  (52(0)  >  0  and  Q4{0)  >  0. 
Then  both  servers  will  work  on  priority  customers  at  their  respective  sta- 
tions over  an  initial  interval  [0,  t],  at  the  end  of  which  either  Q2{t)  =  0 
or  Q^ir)  =  0.  Moreover,  no  new  priority  customers  are  created  during  this 
initial  period,  so  it  is  immediate  that  E{t)  <  oo. 

Suppose  for  the  sake  of  concreteness  that  Q2iT)  >  0  and  Q4{t)  =  0. 
Then  server  2  will  work  on  (priority)  customers  of  class  2  over  an  interval 
(r,  t'J,  during  which  period  no  new  customers  of  class  4  can  be  created. 
Thus  server  1  is  either  working  on  class  1  or  else  idle  throughout  the  pe- 
riod, and  at  the  end  we  have  Q2(''"')  =  Qii'''')  =  0-  The  evolution  of  the 
system  after  time  t'  can  be  divided  into  alternating  "idle  periods"  and 

"busy  periods"  of  length  Ii,Bi,  l2,B2, Each  idle  period  begins  with 

Q2  =  Qa  =  0,  contains  a  single  service  completion,  and  ends  with  either 
Q2  =  I  OT  Q4  =  \  but  not  both.  Using  the  memoryless  property  of  the 
exponential  distribution  we  have  that 

(3.2)  aj  =  £'(/j)  <  a  =  max(mi,m3)     for  all     j  =  l,2, ... 

Of  course,  the  expectation  in  (3.2)  depends  on  the  particular  initial 
condition  Q{0)  assumed,  but  we  suppress  that  dependence  to  simplify  no- 
tation, and  the  inequality  in  (3.2)  is  understood  to  hold  regarriless  of  initial 
state. 

A  "busy  period"  begins  with  one  of  the  two  servers  having  a  single 
priority  customer  to  work  on  (there  may  be  non-priority  customers  present 
at  one  or  both  stations),  and  it  ends  as  soon  as  that  server  runs  out  of 
prior  ty  customers.  Because  of  the  preemptive-resume  prioritier-  and  the 
routing  pictured  in  Figure  1.2,  no  priority  customers  are  created  for  the 
other  server  during  such  a  busy  period,  so  it  ends  with  Q2  =  Qi  =  0 
once  again.  The  key  to  our  analysis  is  the  following  bound,  whose  proof  is 
postponed  until  the  end  of  this  section. 

Lemma  3.1.  There  exists  a  constant  c  >  0,  not  depending  on  n,  such 
that 

(3.3)  bj  =  E[Bj)>cn     for  all     j  =  l,2,.... 

The  transient  states  q  =  (gi,  92,  "Ja,  94)  referred  to  earlier  are  those  with 
92  >  0  and  q4  >  0.  As  we  have  seen,  such  states  cannot  be  revisited  after 
the  initial  interval  [0,  r],  but  all  other  states  of  the  Markov  chain  Q(t) 
communicate  and  hence  there  is  a  unique  stationary  distribution.  Let  us 
denote  by  7r2  the  stationary  probability  of  those  states  q  with  92  >  0,  by 
■K4  the  stationary  probability  of  those  states  q  with  174  >  0,  and  by  ttis  the 
stationary  probability  of  those  q  with  q\  +  q^  =  n  (that  is,  92  =  94  =  0). 
From  the  path  decomposition  described  above  (an  initial  period  of  finite 


expected  duration,  followed  by  alternating  idle  and  busy  periods,  each  with 
finite  expected  duration)  it  follows  that 

(3.4)  7r2  +  7r4  +  7ri3  =  1. 

Moreover,  with  aj  =  E{Ij)  and  bj  =  E{B-i)  as  in  (3.2)  and  (3.3),  we 
have  that 


(3.5)  7ri3  =  lim 


En 


Of  course,  two  equivalent  expressions  for  the  long-run  throughput  rate 
A  are  (here  /x/t  =  ^/rrik) 


(3.6)  A  = /i2'iT2     and     \  =  ^^n^, 

implying  that  A(m2+m4)  =  7r2+7r4.  With  our  convention  that  m2  +  m4  =  1, 
this  and  (3.4)  together  imply 

(3.7)  A  =  (1  -  7ri3)/(Tn2 +m4)  =  1  -  7ri3. 

Together,  (3.2),  (3.3)  and  (3.5)  imply  that  7ri3  — »  0  as  n  — >  cx),  so  we  have 
the  following  major  conclusion. 

Proposition  3.1.  A  -^  1  as  n  — ►  oo. 

Corollary  3.1.  The  long-run  utilization  rate  for  server  1  is  A(m]  -t 
7/14)  ■-►  m\  +7714  <  1  as  n  —*  00.  Similarly,  the  long-run  utilization  rate  for 
server  2  is  \[m2  +  7713)  — >  7n2  +  '^13  <  1  as  n  —*  00. 

It  remains  to  prove  Lemma  3.1.  From  (3.1)  we  have  that 

(3.8)  fii  >  fi2     and     /i3  > /Z4, 

Let  us  suppose  that  the  j"*  busy  period  (its  duration  is  B-,)  begins  with 
the  creation  of  a  class  2  customer,  and  that  k  customers  of  class  '<  remain 
in  existence  after  that  creation  (fc  <  7i  —  1).  Until  the  supply  of  class  1  cus- 
tomers has  been  exhausted,  the  queue  length  process  Q2{t)  is  precisely  that 
of  an  M/M/l  queue  with  input  rate  /ij,  service  rate  fi2,  and  hence  traffic 
intensity  parameter  p  =  f^\/p.2  >  1-  For  such  an  M/M/\  queue  there  is  a 
probability  pi2  >  0  that  a  busy  period  Tiener  ends.  Using  this  and  Wald's 
identity,  we  obtain  a  lower  bound  o{pi2m2k  for  the  conditional  expectation 
of  Bj  given  k.  Similarly,  if  the  i"*  busy  period  begins  with  creation  of  a 
class  4  customer  this  creation  leaving  n  —  k  -  \  class  3  customers  behind, 
then  the  conditional  expectation  of  Bj  is  bounded  below  by  p^i^miin  —  k), 
where  P34  >  0.  It  follows  that  (3.3)  holds  with 

(3.9)  c=  min(pi27n2,P34m4)  >  0- 
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